SYSTEM AND METHOD FOR 
IMAGE COMPRESSION, STORAGE, AND RETRIEVAL 



RELATED APPLICATION 
This application claims the benefit of U.S. Provisional Application Serial No. 60/407,163 
filed on August 30, 2002 the disclosure of which is hereby incorporated by reference in its 
entirety. 

FIELD OF INVENTION 
This invention relates to image compression. The invention also relates to systems and 
methods for storing compressed images and retrieving compressed images. 

BACKGROUND 

It is known in the art to use compression techniques to make graphics Web friendly. For 
example, the Web page "http://web.utk.edu/~cwiek/watchl" describes using Adobe 
PHOTOSHOP software to make graphics images smaller in terms of download time. To this 
end, the Web page describes using various techniques available in the PHOTOSHOP software to 
reduce the size of a JPG file. Specific examples set forth in the Web page include changing the 
resolution of an image, reducing the number of colors in an image, and/or changing the print size 
of an image to reduce the size of the image file and, thereby, allow for the faster downloading of 
the image file. 

SUMMARY OF THE INVENTION 
A system and method for compressing and storing image data. The system and method 
creates for each of a plurality of original images a plurality of resultant images by altering the 
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content of each of the plurality of original images a corresponding plurality of different ways 
(e.g., by applying none, one, or combinations of imaging techniques to the original image in a 
plurality of different combinations and permutations). Each of the resultant images is then 
compressed. From the compressed, resultant images created from each of the plurality of the 
original images one resultant image is selected. Each of the selected one of the plurality of 
resultant images is then stored such that each of the selected one of the plurality of resultant 
images is retrievable to be displayed as a representation of its corresponding original image. 

A better understanding of the objects, advantages, features, properties and relationships 
of the invention will be obtained from the following detailed description and accompanying 
drawings which set forth illustrative embodiments that are indicative of the various ways in 
which the principles of the invention may be employed. 

BRIEF DESCRIPTION OF THE DRAWINGS 

For a better understanding of the invention, reference may be had to preferred 
embodiments shown in the following drawings in which: 

Figure 1 illustrates a flow chart diagram of an exemplary method for creating one or 
more compressed versions of an original image; 

Figures 2 and 3 illustrate flow chart diagrams of an exemplary method for creating a 
concatenation file and look-up table; and 

Figure 4 illustrates an exemplary concatenation file and look-up table created by the 
method generally illustrated in Figs. 2 and 3. 



DETAILED DESCRIPTION 



With reference to the figures, a system and method for modifying and preparing images 
for file compression, storage, and retrieval is disclosed. Generally, image compression formats, 
such as GIF, JPG, and PSD (PHOTOSHOP image format), are non-adaptive techniques for 
compressing images. That is, the compression algorithms utilized are applied in the same way 
5 for all images. However, by altering the content of an image before it is compressed (e.g., by 
using imaging techniques to change one or more of its orientation, coloring, sharpness, etc.), 
improved image compression ratios can be achieved resulting in reduced storage requirements 
while still maintaining the usefulness of the images. Moreover, by storing these better 
compression-ratio images in a particular data structure, storage requirements can be further 
10 reduced. 

Achieving a better compression-ratio and reducing storage requirements are particularly 
useful in solving problems associated with storing information in a hand-held computing device. 
For example, the Compaq iPAQ brand hand-held device has memory expansion chips of limited 
size. For example, chips are available having 12M at a present cost of approximately $700 and 

1 5 128M at a present cost of approximately $ 1 00. Thus, it will be appreciated that, when cost is a 
concern, efficiently utilizing smaller sized memory chips is desirable. Using the compression, 
storage, and retrieval techniques described hereinafter, a more efficient utilization of limited 
memory can be achieved. In this regard, the techniques described hereinafter have been 
successfully utilized to store on a memory chip of limited capacity an electronic catalog 

20 comprised of product information for approximately 100,000 products including approximately 
30,000 images. 

While the description that follows preferably stores images using the GIF compression 
format, it is to be appreciated that this is not intended to be limiting. Rather, any compression 
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technique can be utilized. Furthermore, while described in the context of utilizing functions 
found in the Adobe PHOTOSHOP brand imaging software, it will be appreciated that other 
commercially available imaging software applications can also be utilized. 

To reduce the amount of memory required to store the image files, the physical image 
5 display-size for each of the original images may be initially adjusted. This size adjustment may 
be vertically and/or horizontally utilizing conventional imaging techniques. Given the intended 
target platform of a hand-held computing device, such as the Compaq brand Pocket PC, the size 
adjustment can be used to create a resultant image that has a size that would allow the resultant 
image to be displayed in the display of the hand-held device without requiring scrolling. Since 

10 image-dimension adjusting alone may be insufficient to allow all of the resultant images, when 
compressed, to be fit within the memory of the hand-held device, along with any corresponding 
textual information, further imaging techniques may be employed to further reduce the 
compressed file size of the resultant images. 

A further method for reducing the amount of memory required to store image files 

1 5 consists of manipulating the reduced or original images (hereinafter collectively referred to as 
the original image) using techniques available in the imaging software, such as rotating the 
image, sharpening the image, re-sampling the image, etc., before compressing the image (e.g., 
storing the manipulated and uncompressed image as a GIF or the like type of file). By using the 
techniques individually or in various sequences and combinations (which can be performed 

20 manually or automatically using macro features of the software application), a large number of 
resultant images can be created that represent the original image. The resultant image with the 
smallest compressed file size for each original image can then be selected for use in the 
electronic catalog. To further reduce the amount of memory required to store the compressed 
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image files, the binary file content of the compressed, resultant images selected can be 
concatenated into a concatenation file, a lookup table created for accessing the binary file 
contents within the concatenation file, and techniques, described hereinafter, can be used to 
extract and display images from the concatenation file. 
5 More specifically, there are numerous imaging techniques that can be used to prepare or 

manipulate the original images before compressing the manipulated image with a given image 
compression algorithm. It is to be understood, however, that not all imaging techniques will 
yield a smaller image size (when compared to the original image size) after the technique is 
applied and the resultant file compressed. Moreover, some techniques yield better results only 

10 for certain image types. Therefore, because the file size of a compressed, resultant file is not 

generally known before it is compressed, a plurality of different individual imaging techniques or 
combinations of imaging techniques may be applied to the original image to increase the chance 
that a specific imaging technique or combination of imaging techniques produces a compressed, 
resultant image that has file size that is smaller when compared to the compressed file size of the 

15 original image. By applying this "shotgun" approach when manipulating the image content, it is 
likely that several of the imaging techniques or combinations of imaging techniques create a 
smaller, resultant file size, when compared to the file size of the original image, for any given 
compression algorithm. By studying which techniques tend to work better, additional techniques 
can be created which magnify the effectiveness. 

20 By way of example, the original images can be manipulated by applying one or more of 

the following techniques, which include, but are not limited to: 

a) Rotating the original image by 90, 180, or 270 degrees and/or flipping the image - For 
images that contain a rotation or flipping that aided in creating a smaller compressed file, a 
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corresponding re-rotation in the opposite direction (or further rotation to achieve a total of 360 
degrees of rotation) or flipping is applied after the image is extracted and uncompressed from the 
concatenated file and before it is displayed on the device. In this way, it has the same orientation 
as the original product image. Especially with the GIF image format, it was an unexpected result 
5 that rotating the image 90, 180, or 270 degrees sometimes resulted in a large savings in file size. 

b) Changing the number of colors in the image — Graphic images consist of pixels. Each 
pixel, or "dot" on the screen or printer, consists of information about the color of the pixel. One 
byte of information can represent one of 256 different colors or one of 256 shades of gray. By 
adjusting the number of bits of information for each pixel, the number of color possibilities for 

10 each pixel can be affected. More bits allow for more choices of colors for each pixel. Images 
with 24-bit/pixel color (over 1 million possible colors for each pixel) are near photographic 
quality. Images with 8-bit/pixel color (256 possible colors per pixel) are common on the 
Internet. In so many computing applications there is a balance between things like speed, size, 
complexity, etc. Depending on the application, it is sometimes a good balance to have images of 

15 moderate quality (less bits per pixel) since the size of the image is often reduced, allowing the 
images to fit in a fixed memory device where higher-quality images would not all fit. By 
reducing the number of colors in an image but still meeting the minimal quality requirements of 
the application, a balance is struck between memory requirements (smaller files) and image 
quality. 

20 c) changing the image to grayscale if the image is murky and lacks vivid colors - In some 

applications, it might make sense to convert images that lack strong color content to grayscale. 
Grayscale images with only 4 bits/pixel (16 possible shades of gray per pixel) still look 
reasonable. Depending on the image format, it is possible to examine the header of the file to 
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determine what colors exist in the image. If some or most of the colors are gray (equal amounts 
of red, green, and blue) or are near grayscale, then a decision can be made to convert to, say, 4- 
bit grayscale from a 5-bit or higher color image. 

d) re-sampling the image at a different resolution - A technique used in music recording 
5 is re-sampling. Current commercial music CDs have 16-bit resolution. That is, 2 A 16 possible 
values for each sample of music. Each sample of music can be thought of as corresponding to a 
pixel, the smallest level of granularity. If a music CD is re-sampled (or converted) to, say, 12-bit 
resolution, then the amount of information is reduced 25%. Of course, some information is lost 
in the process which results in a degradation of the sound. However, 12-bit resolution in music 

10 roughly corresponds to FM radio quality, which is still very acceptable even after 25% of the 
original information is lost off the CD the DJ is playing. In this same way, an image can be re- 
sampled at a different (usually lower) resolution. Instead of the original 72 dpi (dots per inch), 
the image might be re-sampled at 71 dpi or 70 dpi. Of course, some information in the image 
will be lost. However, in some cases there is a reduction in complexity of the image from the 

1 5 compression- algorithm's point of view (e.g., GIF) and the image will compress better and store 
in less space. The effect of losing some information sometimes is not noticeable. Other times 
this loss of information manifests itself as a "softer" image that is not as crisp and the original. If 
the re-sampling resolution is chosen carefully, smaller images are produced that are still very 
usable for the application. 

20 e) sharpening the image — This is a common image manipulation technique that helps 

create more defined delineations between different color areas in the image. The result is that 
the image looks "cleaner" or "crisper " Depending on the image, this technique can make the 
image look better or worse (more cartoony) and can make the size of the resulting file increase or 
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decrease when compared with the original. This process can be repeated one or more times to 
the same image, further increasing or decreasing the size of the file. 

f) leaving the image as-is - Sometimes the original image has the smallest file size even 
after trying different techniques to reduce the size of the file or it is the only useable image for 

5 the intended application. 

g) changing the image contrast - Another technique for trying to reduce the size of the 
image is adjusting the contrast of the image. By slighting increasing or decreasing the contrast, 
the altered image can sometimes be smaller than the original and still be useful for the 
application. 

10 h) changing the image brightness - Another technique for trying to reduce the size of the 

image is adjusting the brightness of the image. By slightly increasing or decreasing the contrast, 
the altered image can sometimes be smaller than the original and still be useful for the 
application. 

i) changing the opacity of the image. 

15 When used in combinations, the order in which these techniques are applied to an original image 
can also be varied. It will be appreciated that these techniques are available as functions or 
sequences of functions that can be applied with an image/photo manipulation program like the 
PHOTOSHOP brand imaging software. Such imaging software may also provide for a macro 
facility where different sequences and combinations of these techniques can be recorded and 

20 applied in a batch mode. Each of these macros can then be applied across the set of original 
images in an automated manner. 

Because it is not possible to tell ahead of time which imaging technique (or sequence of 
techniques) will create the smallest compressed file size for any given original image, a plurality 



of various techniques (or sequences of techniques) can be applied to all of the original images. 
Performing the techniques in a batch mode is preferred in the case of a large number of original 
images. For example, applying 20 image manipulation macros to about 30,000 images yield 
about 600,000 images of varied file sizes, some larger than the original image, some smaller than 
5 the original image. By simple inspection (either manual or automatic), the smallest sized file for 
the resultant images corresponding to each of the original images is found by examining the 
output sets created by each of the selected image manipulation techniques (for example, the 20 
macros). This smallest sized of the compressed, resultant image files may then be selected and 
used in the concatenation method described below to collectively store the image files of choice. 

10 Alternatively, rather than select the smallest sized resultant image file that corresponds to each of 
the original images, the files in the output set having the smallest, collective compressed file 
sizes can be simply used (e.g., the one output set from the 20 output sets generated by the 20 
macros with the smallest, collective size is selected for use). It will also be appreciated that the 
smallest resultant file need not be selected (for example, the image quality may not be 

1 5 acceptable) and that the smallest resultant file having an acceptable quality may be selected 
(which may lead to the result that sometime the original image file is utilized). Preferably, it is 
known which techniques were employed to produce the selected resultant file (e.g., the resultant 
files are tagged or the resultant file set is tagged to provide an indication as to what manipulation 
technique(s) were employed to produce the results). 

20 Once the compressed, resultant image files that are representative of the original images 

are selected, the selected files are preferably stored in a concatenation file. In connection with 
the creation of the concatenation file, a look-up table is created that allows the image files to be 
individually retrieved. To this end, the look-up table may have the following fields: 
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original_file_name; starting_position (in the concatenation file); and length_ofJile. The look-up 
table may also include a field that describes what (if anything) should be done to a retrieved 
image file to allow the retrieved image file to be displayed correctly (i.e., what technique should 
be applied after the image file is uncompressed for display). For example, the file may need to 
5 be rotated or flipped or both if the selected file was created as a result of rotating or flipping the 
original image. 

To create the concatenation file and look-up table, the resultant files for each of the 
original files may be examined to determine which resultant file is the smallest. This file may 
then be selected for inclusion in the concatenation file and the file name can be placed in the 

10 field original_file_name of the look-up table (e.g., to allow for name retrieval of the image). At 
this time, the data of selected file corresponding to the original image under consideration can be 
concatenated to the current end of the concatenation file. The byte starting position of the data 
being concatenated to the concatenation file is recorded in the corresponding startingj>osition 
field of the look-up table and the length of the selected file (i.e., the number of bytes written to 

15 the concatenation file) may also be recorded on the same record in the length_of_file field of the 
lookup table. At this time, data can also be written to a technique field in the record of the look- 
up table to indicate the degrees the image was rotated and/or if the image was flipped when it 
was generated using the techniques described previously. For example, if the resultant image 
were subjected to a rotation of 90 degrees, it should be rotated 90 degrees in the opposite 

20 direction or another 270 degrees (in the same direction) when displayed so as to accurately 

reflect the orientation of the original image. Similarly, if the resultant image were subjected to a 
flipping, the image should be re- flipped when it is displayed. Concerning the rotation, the data 
stored in the technique field can indicate the rotation that is required to correctly orient the image 
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or the rotation that was performed on the resultant image from which the correction rotation can 
be easily determined. The resultant images for each of original image files may be processed in 
this same manner until there are no more original image files to consider using this process. 

As noted, when it is desired to display an image retrieved from the concatenation file, the 
look-up table is consulted to determine the starting location of the image file and the length of 
the image file. The data may then be read from the concatenation file using this information, the 
data uncompressed, and appropriate imaging techniques utilized (if needed) to then display the 
image which is understood to be a representation of the original image. Those of ordinary skill 
in the art will appreciate how to code or acquire stand alone applications for applying necessary 
imaging techniques to the uncompressed images based on information read from the look-up 
table to correctly display the retrieved image. 

From the foregoing, it will be appreciated that the system and method described herein 
allows more information to be stored in a memory of limited size. For example, the 
methodology allows for the reduction of slack space by creating one concatenated file (e.g., 
eliminating the slack space for nearly 30,000 images is a substantial saving in memory use). In 
this regard, slack space refers to file storage overhead. That is, memory space may be allocated 
in fixed, predetermined blocks such that more memory is allocated to data storage than is 
actually needed to store the data. By way of example, in the WINDOWS NT 4.1 brand operating 
system, a file that has 1 byte of information will actually take up 512 bytes of physical disk 
storage, i.e., 512 bytes is the predetermined block size. Percentage wise, smaller images tend to 
have a greater percentage of wasted slack space than larger files. However, as more and more 
images are present, even small amounts of slack space adds up to a large amount of wasted 
space, especially when storage capacities are relatively small and fixed. Thus, by creating fewer 
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files, ideally one, and concatenating the contents of each file, file by file, into these fewer 
concatenation files, almost all of the slack space can be eliminated. 

In addition, memory is conserved by creating a smaller image directory (e.g., by creating 
one concatenated file, the directory space for the corresponding 30,000 images is almost entirely 
5 eliminated). As will be appreciated, file header overhead is contained in the file directory and 
stores information like file modification time and date, file size, attribute information, etc. While 
this information is helpful in some cases, for large numbers of images, this adds up to a large 
amount of directory space. For certain application where files are more-or-less static, this 
information is not needed. By eliminating it, storage space can be reduced. Thus, it will be 

10 appreciated that, since there are only a few files which now contain the data from many files, the 
directory space has been reduced as well. 

Still further, the subject method and system provides faster deployment times (e.g., 
copying one large concatenated image file to a device is faster than copying 30,000 images to a 
device, as there is overhead with each file copied). By way of additional example, when utilized 

15 in connection with the 30,000 images that comprised a test file, the concatenation method 

without any image modification techniques applied yielded a file of about 1 10M. After carefully 
selecting techniques and sequences and combinations of techniques, the concatenation method 
had much smaller files to work with and created a file of about 69M (about 40% savings) 
without significantly affecting the value and usefulness of the images. Thus, utilizing the 

20 methods described herein, more data can accompany the images and applications can fit on more 
types of devices with less cost for memory. 

While specific embodiments of the invention have been described in detail, it will be 
appreciated by those skilled in the art that various modifications and alternatives to those details 
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could be developed in light of the overall teachings of the disclosure. Accordingly, the particular 
arrangement disclosed is meant to be illustrative only and not limiting as to the scope of the 
invention which is to be given the full breadth of the appended claims and any equivalents 
thereof. 
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